Saturday, December 18, 2010

SBR600 Wrap up

Final update for:
Supporting Architectures above armv5tel project:

Another wiki page I created:
How to Use Dhrystone

Test Result:

It is proven that armv7 and armv5 arch optimizations provide the same level of performance especially when running C library dependent programs on cdot-beagleXM-0-3 builder.

One probable reason of why the benchmark gave the same results is that the system tested was built to use an ABI called "softfp". Although beagleboardXM (cortex-a8)supports the "hard floating-point" ABI, The Fedora12-ARM system binaries were built to use "soft floating point ABI" which is incompatible with "hard floating point ABI".

I can't technically define what "soft floating point" and "hard floating point" are, but I've read adequate information, enough to give me basic understanding about ABI's and Floating points. ARM Floating point is a pretty big topic, provided are some links to helped you understand more.


In conclusion, Fedora-ARM can continue to use armv5tel codes on armv7 machines. Deciding to recompile the whole universe for armv7 is somehow an inconvenient recommendation for now.

Some final thoughts...

SBR600 is one of the professional option subjects I recommend for students taking CTY/CNS at Seneca. I'll never forget this experience; how it feels like my project is as important as my 1-month food budget! Chris Tyler introduced us to a new level of learning experience.

I can't say much of the technical knowledge I gained from this subject because it's like a "little of everything". Nevertheless, these bits of knowledge can be considered essential to an aspiring linux professional. One important thing I learned is the importance of people skills. I consider this a "must have" personal characteristic; for communication shares a part to the success or failure of a project may it be in school, or in a work place.

Lastly, I want to thank Chris Tyler for guiding me through the project and Paul Whalen for quick help responses; the Fedora community for opening its doors in welcoming students to work on live projects.

It has been such a great semester! Thank you all for your interest in following my project updates!

Monday, November 22, 2010

Dhrystone benchmark results for the cdot-beagleXM-0-3

After a few weeks of research and tests, I finally gathered enough information about the general processor performance of the cdot-beagleXM-0-3 builder by running a benchmark program named Dhrystone.

Chris Tyler recommended that I start my benchmarking with Dhrystone just to measure the general performance of the beagleXM. The goal is simple:

-Run Dhrystone and record the result
-Optimize Dhrystone for armv5tel/armv7 and compare/record the results.

I thought it was going to be as easy as just downloading the program, running it, and reporting the results. It never came to me why Chris recommended Dhrystone as the benchmark program for the project. As I worked through the 0.1 goal of my project, I was able to get a better understanding about Dhrystone; it's limitations and advantages.

Reasons for using Dhrystone:
  • ARM® recognizes the program and uses it as a performance attribute of their processors.
  • Dhrystone provides a more meaningful MIPS (Million Instructions Per Second) because results are compared to a reference machine.
  • Dhrystone numbers reflect the performance of the C compiler and libraries more so than the performance of the processor itself. (considered as a weakness of the program)

Dhrystone’s execution is largely spent in standard C library functions. The compiler and the C library affects Dhrystone benchmark result, making it suitable for measuring the general performance of cdot-beagleXM-0-3.The cdot-beagleXM-0-3 builder currently uses armv5tel glibc standard library. If an optimized Dhrystone on armv5tel glibc beagleXM is as fast or has insignificant difference against an armv7 glibc (on the same machine), there is little or no need to focus on recompiling glibc and a lot of C library dependent programs/packages for cdot-beagleXM-0-3 builder.

Here is a graph showing the results of the Dhrystone benchmark:
Notice how both optimizations (armv5tel/armv7) resulted in the same number of DMIPS.

Legend:
Normal = 758.869322709 DMIPS
Optimized for armv5tel = 1034.82179852 DMIPS
Optimized for armv7 = 1034.82179852 DMIPS

The gcc optimization options used for compiling Dhrystone are both ARM architecture specific. One good thing about the result of the benchmark is that optimization increased the overall performance of cdot-beagleXM-0-3 by 36%. The results for both armv5tel and armv7 optimizations are the same. It's safe to assume that the armv5tel glibc impacts the performance of C library dependent programs such as Dhrystone.

Next Dhrystone run will be for an armv7 glibc. Results can then be compared to the data I gathered so far. I am also looking at recompiling the kernel as some sort of an "application benchmark".

Useful Links:

Dhrystone and MIPs performance of ARM processors
DMIPS vs MIPS
Download Dhrystone 2.1

White Papers/Guides:

  • ECL Dhrystone White Paper
  • Dhrystone MIPS - Criticsm by ARM
  • Dhrystone 2.1 README notes

Thursday, November 11, 2010

Playing around with GIT

"Git is distributed version control system focused on speed, effectivity and real-world usability on large projects." It's a wonderful tool for individual developers to work on codes in a peer-to-peer approach.

I still have yet to try more about Git, but for now I'll show how Git can be used as a tool that can backtrack any progress when working on some local files.

Here's how I played around Git:

I created a parent directory called testplay which contains a directory for the nled package.
All the commands that follow are done inside the testplay directory.

To let Git take revision control of the directory, I typed:
#git init
Git initializes with the message "Initialized empty Git repository in /home/user/testplay/.git/"

So far Git has no idea about the files or "lines" that it will track. In order for Git to have a snapshot of the directory, 2 commands are needed:
#git add .
AND
#git commit

This enables Git to store the snapshot permanently in the repository. Every commit creates a version (When commit is run, Git automatically takes the user to Vi editor for a commit message).

Alternatively, using git commit -a does the same purpose as git add and git commit.

Branches

The snapshot/version I created was made under a "branch" named "master" by default. Any commits will update this branch. Git can have multiple branches. Branches are names that points to a particular commit. Given are the commands used to best explain what branches are and what they do.

Display the branches in the repository
#git branch
In my case, I only have 1 branch so the output shows:
* master (where the "*" tells the user that the master branch is active.)

Create another branch from the master branch
#git branch extend1

Git now has 2 branches. I can now create 2 versions of my project based on these branches. A commit or change in one branch wouldn't affect the other unless merged.

To make a desired branch active (In my case 'extend1'):
#git checkout extend1
Produces a message: "Switched to branch 'extend1'"

I created a random file inside the nled/ directory and ran the git add and git commit. If I go back to my "master" branch, the file I created wouldn't be visible; this approach is very useful for debugging and testing out program compilations. :)


Useful Links

Official Git page
gittutorial page
gitmagic

Thursday, November 4, 2010

Optimizing a package for ARMv7 architecture

This is part of the 0.1 stage of Supporting Architectures above armv5tel project. I need to test an optimized package for BeagleboardXM-ARMv7 architecture and document the results. Packages are needed to be optimized to take advantage of ARMv7 processor features namely:
  • Thumb®-2/ Thumb
  • NEON™
  • VFPv3 Floating Point

I was supposed to test my compile on cdot-beagleboardXM-0-3 yesterday and today, but it’s down because the adapter is loaned out.

Anyway, I’ll just show my compiler optimization options and would appreciate any comments.

Compiling a package using ARM specific optimization option (using GCC 4.4.x):
CFLAGS="-O2 –march=armv7-a –mtune=cortex-a8 –mfpu=neon
–mfloat-abi=hard –fomit-frame-pointer"

*Using -mfloat-abi=hard with VFP coprocessors is not supported. Use -mfloat-abi=softfp with the appropriate -mfpu option to allow the compiler to generate code that makes use of the hardware floating-point capabilities for these CPUs. I would have to use this in any case -mfloat-abi=hard doesn't work

Optimization definition:

-O2
-O2 turns on all -O optimizations and all other optimizations that don't greatly increase binary size or interfere with debugging. -O2 is even better than -O, and usually just as safe. This is the optimization level most commonly used for packages and distributions in the Linux world and for the Linux kernel.

-march=your_arch
-march= tells gcc to optimize for a certain architecture. Basically, you just need to know what your CPU is, and the GCC name for it. This may break compatibility with other architectures!

-mtune=/-mcpu=
-mtune=, or -mcpu in older versions of GCC, is similar to -march and accepts the same options. Unlike -march it doesn't break compatibility with older arches. -march and -mtune/-mcpu options can be mixed to get the desired effect.

-mfpu=name
This specifies what floating point hardware (or hardware emulation) is available on the target.

-mfloat-abi=name
Specifies which floating-point ABI to use. Permissible values are: `soft', `softfp' and `hard'.

Specifying `soft' causes GCC to generate output containing library calls for floating-point operations. `softfp' allows the generation of code using hardware floating-point instructions, but still uses the soft-float calling conventions. `hard' allows generation of floating-point instructions and uses FPU-specific calling conventions.

-fomit-frame-pointer
-fomit-frame-pointer tells gcc to omit frame pointers, freeing up an additional register on the CPU. This is mainly useful on x86 as most other arches, like AMD64, have it on by default at -O2 or greater, though binary size may increase slightly. This flag breaks debugging on x86 and possibly other arches unless you're compiling with gcc 4.x and the -fvar-tracking flag.

Sources:

CFLAGS Definition
GNU Compiler Collection
GCC ARM Options
Armin76's Blog

I would like to extend my thanks to the people on IRC and the community Including Chris Tyler and Paul Whalen for guiding me in this project.

#seneca
#fedora-arm
#ubuntu-arm
#gentoo-embedded


Documentation is still on process, the test results will be posted by next week:

Monday, November 1, 2010

FSOSS 2010 - Day 2 (October 29, 2010)

I wasn't able to blog last Friday (October 29, 2010) due to my sickness. After my volunteer shift around 3:00PM that day, I started feeling lightheaded. I rushed home and skipped my awaited presentation (Paul Whalen's ARM Project). As soon as I got home, I crashed into bed and had a fever later that night.
I'm feeling a little better now. I hope I don't miss tomorrow's SBR600 class due to this.


Here are the presentations I attended to last Friday:

The Business of Linux - How individuals can get in the game
It's an hour talk about the basics of business practices do's and don'ts. It was presented by Ms. Karlie Robinson.

Systemtap - System Wide Probing on Linux: Focus on User Space
I personally think this is an awesome project, but I didn't really get how they presented it. I don't know if I was just not techie enough or it was just too technical. It was presented by Mr. Dave Brolley and his team of 2 guys (names didn't appear on the presenter list)

How CMS architectures affect dev. communities
I was expecting too much for this presentation. I thought I was going to find out which CMS system is the most popular, widely used... etc. Although the presentation only lasted for 30 minutes I still learned about architectures, programming languages used and a little about Content Management System evolution. It was presented by Mr. Julian Egelstaff.


Useful Links:

Karlie Robinson's Blog.
Systemtap

Thursday, October 28, 2010

FSOSS 2010 - Plug Computing Workshop (October 28, 2010)

Since I started in my program, I never missed a single FSOSS event. I can't find any reason not to join; It's cheap, I learn things that for sure won't be available anywhere else, and lastly..FREEBIES!! Of all the FSOSS events, workshops have always been what I like the most. Today I showed up for Chris Tyler's Plug Computing Workshop.

The workshop covered everything from setting up the GuruPlug to creating a personalized image server. I'm convinced on how powerful ARM PCs are, so I'm considering of getting one and probably experiment on it.

As always, workshops include Q&A part. I never had a chance to ask any question but I learned from Chris some important things to know about the GuruPlug and similar ARM based plug computers
  • GuruPlugs have problems with Gigabit Ethernet (slows down for some odd reason)
  • Overheat problems (the reason why it's got a fan in the middle)
  • GuruPlugs consume electricity comparable to a radio clock
Useful Links:
PandaBoard
BeagleBoard
SheevaPlug

Friday, October 22, 2010

The day I learned about Python

Chris Tyler discussed a little bit about Python today for my classmates working on projects that include moderate to heavy scripting. I'm not really good at programming languages but I always find myself interested to learn about it.

As soon as I got home, I hurriedly turned on my PC and wrote my first Hello World in Python. Sir Chris told me last Thursday when I asked him which programming language will stand out the most in the near future, He replied... "Python".
I definitely agree to what he said; Considering some popular software embed/use Python.

Anyway I'm interested in learning more so I downloaded my copy of "Dive into Python".

Regarding the ARMv7 Project, I haven't had a time this week. I still need to catch up on my readings.

Wednesday, October 20, 2010

Details about the ARMv7 Project

I had a great time talking to Chris Tyler yesterday about the ARMv7 Project. He explained the tests needed for the project and briefly contrasted ARMv5 & ARMv7.

My project details include:
-Working on the cdot-beagleXM-0-3 host
-Study the make command and the gcc compiler
-Compile software under ARMv7

Just a quick info:
cdot-beagleXM-0-3 is an ARMv7 board that tests ARMv5 architecture builds. What does that mean?? I can't think of any good analogy but this.. It's like running an old PC game on a NEW computer. It will work, but without the enhanced features on.
With that said, my project will try to test if the technologies in ARMv7 provide significant performance difference against ARMv5.

BeagleBoard XM image


More About ARM and BeagleBoard
BeagleBoardXM
List of ARM processors
Supporting Architectures above armv5tel Project Page

Wednesday, October 13, 2010

Using Mock & Koji

After wasting 2 weeks of trying to build my own spec file from scratch, It's a pain to say that I need more time to fully understand some of the macros used in my privoxy build. privoxy's spec file although short, has lines meant for advanced builders. It's a real challenge for a noob builder like me!! :)

On the brighter side, building a spec for nled walked me through the basics I need. The "How to Create an RPM Package" and the "Maximum RPM" are both good resources.

Using Mock

The "Using Mock to test package builds" page has all the instructions and links needed for a successful mock build (In which I noticed that my build was slower compared to using rpmbuild-ba). I also followed the instructions for setting up SELinux policy for mock.



Using Koji

Using Koji in Fedora is also pretty straight forward. Setting up my certificates and uploading my build didn't take long.

Here's an image of my build status:



Lastly....

I'm putting off my tests for rpms to work on the armv7tel project. I only have a project page draft for now, but I'll make sure to document everything.

Details on my armv7tel project can be viewed here

Wednesday, September 22, 2010

Week 2 (September 13) - RPM Packages I (using rpmbuild)

Objectives:
- Prepare system for rpm packaging by installing core development tools.
- Rebuild/Repackage an existing RPM package using rpmdev tools and play around with the .rpmmacros -j value for each compile.

Preparations for rpm packaging:

In order to successfuly repackage a source rpm, certain tools are needed in my fedora system.

I need all of the following (Install using yum):
rpmdevtools
rpm-build
rpmlint
rpm
yum-utils

Rebuild/Repackage an existing RPM package:

NOTE: None of the steps below require root
STEPS:
1. Once all the required tools are downloaded, I need to create my build directory using the command
#rpmdev-setuptree
The rpmdev-setuptree command will create the rpmbuild directory located at my user's home (~/rpmbuild).

2. Download a source package using yum.
#yumdownloader --source privoxy

3. Install the downloaded source package (which places the package's .spec file into ~/rpmbuild/SPECS and other source and patch files in ~/rpmbuild/SOURCES).
Command: rpm -i pkg*src*rpm
#rpm -i privoxy-3.0.16-2.fc13.src.rpm

4. I then have to go to ~/rpmbuild/SPECS/ directory and check the spec file of the software to be packaged; then run the "rpmbuild" command on the spec file, which will go through a series of steps to produce my packages.
#rpmbuild -ba privoxy.spec

Editing the .rpmmacros file and change the value for -j:

The .rpmmacros file is an invisible file located in user's home directory (~/.rpmmacros)
There's not much I know of this file (I'll study more about it) but as Chris Tyler stated in class, the %_smp_mflags -j3 (where -j3 is the default option)
tells the kernel on how to fast it should go when installing a software.
So I played around with the %_smp_mflags -j3 value. I was expecting that a higher -j value would mean faster build time.....

I was wrong!

Unfortunately for my build, the   -j3 (default) option, is  .565 seconds slower than the -j1 option  in time (user mode). I tested other options up to   -j5 which resulted to slower build times.

I will  test my build on other systems this coming Friday to make sure my observation is correct. The test was ran on a Fedora 13 system (virtual machine) using Windows 7 as a host (AMD dual-core).

UPDATE: (September 26, 2010)

I learned from Chris that the -j1 option in the .rpmmacros file is well suited for single core processors. I proved it true when I checked my virtualbox settings for my guest machine. I have a single core processor setup.

I recompiled the same package, only this time in an amd dual core system. The default value -j4 seems to be well suited for a faster compile.

Monday, September 13, 2010

Lab 1

Last friday was an encouraging class session. Chris Tyler played a couple of FSOSS videos that featured the people behind the fedora community. Watching the video stirred me to think of ways of how to contribute more to the Open Source community. Well, this blog is one of it...

As for the first lab, I installed Truecrypt on my Fedora 13 system. It wasn't a smooth install for I have to resolve some dependencies needed for the code compilation.

Here's how I installed it:

1. I downloaded the source code package: (Linux .tar.gz)
    http://www.truecrypt.org/downloads2


2.  Untar, move to the extracted directory and start compiling.
     tar xvf truecrypt-7.0a-source/
     cd truecrypt-7.0a-source/
     make

3.   The first batch of errors filled my screen that says:
      /home/user/Downloads/truecrypt-7.0a-source/Common/SecurityToken.h:211:error: template argument 4 is invalid

     I took notice of the word "SecurityToken.." googling lead me to RSAlabs.The set of files are part of an API called Cryptoki

     Here are the commands needed to install it:
    
     wget ftp://ftp.rsasecurity.com/pub/pkcs/pkcs-11/v2-20/pkcs11.h
     wget ftp://ftp.rsasecurity.com/pub/pkcs/pkcs-11/v2-20/pkcs11f.h
     wget ftp://ftp.rsasecurity.com/pub/pkcs/pkcs-11/v2-20/pkcs11t.h



4. I stumbled on my 2nd error when I tried to compile again; this time the error reads:

     Assembling Aes_x64.asm
     make[1]: nasm: Command not found
     make[1]: *** [../Crypto/Aes_x64.o] Error 127
     make: *** [all] Error 2

    Before I stumble into more errors, I decided to google and find known  dependencies for Truecrypt.

     yum install nss-pkcs11-devel fuse-devel wxGTK wxGTK-devel

    Then I installed nasm assembler

5.  I recompiled and successfully installed Truecrypt to my system.







Helpful Links:

http://penguinenclave.blogspot.com/2009/07/truecrypt-62-install-guide-for-fedora.html

http://linuxsysadminblog.com/2009/07/install-truecrypt-on-fedora-10/





     


      

Thursday, September 9, 2010

My first Blog

Brief Intro

Hi! My name is Mark, currently on my 7th semester taking up CTY at Seneca College. I will be using this blog to post SBR600 related topics including some Open Source knowledge bits n bytes.

Link to my FAS2 Page: https://fedoraproject.org/wiki/User:Mjeamiguel
Seneca Wiki Link: http://zenit.senecac.on.ca/wiki/index.php/User:Mjeamiguel

my IRC name is mjeamiguel, most of the time I'll be at #seneca under the freenode network:

Here's my quick chat session using Chatzilla, (a  pretty amazing plugin for firefox )